# Multi-turn Dialogue Support
Qwen3 8B Q4 K M GGUF
Apache-2.0
This is the GGUF format version of the Qwen3-8B model, suitable for the llama.cpp framework and supports text generation tasks.
Large Language Model
Transformers

Q
ufoym
342
3
Granite 3.3 8b Instruct Q8 0 GGUF
Apache-2.0
This model is a GGUF format model converted from the IBM Granite-3.3-8B instruction fine-tuned model, suitable for text generation tasks.
Large Language Model
G
NikolayKozloff
36
2
Gemma 2 2b It Tool Think
MIT
Text generation model fine-tuned based on google/gemma-2b-it, supporting tool call reasoning process
Large Language Model
Transformers

G
langdai
36
2
Qwen2.5 0.5B Instruct
Apache-2.0
A 0.5B parameter instruction fine-tuned model designed for the Gensyn reinforcement learning group, supporting local fine-tuning training
Large Language Model
Transformers English

Q
Gensyn
2.4M
5
T0 S1 14B
Qwen2.5-14B-Instruct is a 14 billion parameter instruction fine-tuned large language model based on the Qwen2.5 architecture, optimized on the s1K dataset.
Large Language Model
Transformers

T
TomasLaz
47
2
Orpheus 3b 0.1 Ft Q6 K GGUF
Apache-2.0
This is a GGUF format model converted from canopylabs/orpheus-3b-0.1-ft, suitable for text-to-speech tasks.
Large Language Model English
O
TheVisitorX
191
0
Gemma 3 12b It Q5 K S GGUF
This is the GGUF quantized version of Google Gemma 3B model, suitable for local inference and supports text generation tasks.
Large Language Model
G
NikolayKozloff
16
1
Gemma 3 27b It Q4 K M GGUF
This model is a GGUF format version converted from Google's Gemma 3 27B IT model, suitable for local inference.
Large Language Model
G
paultimothymooney
299
2
Llama Joycaption Alpha Two Hf Llava FP8 Dynamic
MIT
This is an FP8 compressed version of the Llama JoyCaption Alpha Two model developed by fancyfeast, implemented using the llm-compressor tool and compatible with the vllm framework.
Image-to-Text English
L
JKCHSTR
248
1
Deepseek R1 Distill Llama 8B GGUF
DeepSeek-R1 is an 8B-parameter inference model based on the Llama architecture, utilizing 1.58-bit + 2-bit dynamic quantization technology to enhance precision
Large Language Model English
D
unsloth
37.60k
266
Internlm3 8b Instruct Gguf
Apache-2.0
The GGUF format version of the InternLM3-8B-Instruct model, suitable for the llama.cpp framework and supporting multiple quantization versions.
Large Language Model English
I
internlm
1,072
26
Tanuki 8B Dpo V1.0
Apache-2.0
Tanuki-8B is an 8B-parameter Japanese large language model optimized for dialogue tasks through SFT and DPO, developed by GENIAC Matsuo Lab
Large Language Model
Transformers Supports Multiple Languages

T
weblab-GENIAC
1,143
41
Mistral 7B Banking V2
Apache-2.0
A banking-specific large language model fine-tuned based on Mistral-7B, focusing on banking transactions and customer support scenarios
Large Language Model
Transformers

M
bitext
97
1
Dolphinhermespro ModelStock
Apache-2.0
This model is a hybrid created by merging the Dolphin-2.8 and Hermes-2-Pro 7B-parameter models using the LazyMerge toolkit, based on the Mistral-7B architecture.
Large Language Model
Transformers

D
Kquant03
14
1
Minicpm MoE 8x2B
MiniCPM-MoE-8x2B is a Transformer-based Mixture of Experts (MoE) language model, designed with 8 expert modules where each token activates 2 experts for processing.
Large Language Model
Transformers

M
openbmb
6,377
41
Mistral 7B OpenOrca Q4 K M GGUF
Apache-2.0
This model is a GGUF format model converted from Open-Orca/Mistral-7B-OpenOrca, suitable for text generation tasks.
Large Language Model English
M
munish0838
81
2
Sciphi Mistral 7B 32k
MIT
A large language model fine-tuned based on Mistral-7B-v0.1, focused on enhancing scientific reasoning and educational capabilities
Large Language Model
Transformers

S
SciPhi
143
72
Codellama 13b Oasst Sft V10
A version fine-tuned by Open-Assistant based on Meta's CodeLlama 13B large language model, supporting English, with a new RoPE Theta value (1e6 instead of 1e4).
Large Language Model
Transformers English

C
OpenAssistant
159
69
Vicuna 7b V1.5
Vicuna is a chat assistant fine-tuned from Llama 2, trained on user-shared dialogues from ShareGPT.
Large Language Model
Transformers

V
lmsys
255.23k
335
German Gpt2
MIT
This is a German language model based on the GPT-2 architecture, specifically optimized for German text generation tasks.
Large Language Model German
G
anonymous-german-nlp
176
1
Distilbert Base Squad2 Custom Dataset
A model fine-tuned on SQuAD2.0 and custom Q&A datasets based on Distilbert_Base, focusing on efficient Q&A tasks
Question Answering System
Transformers

D
superspray
17
0
SBERBANK RUS
The Russian version of GPT-2 is a text generation model developed based on OpenAI's GPT-2 architecture, specifically optimized and trained for Russian text.
Large Language Model
Transformers Other

S
Mary222
16
2
Featured Recommended AI Models